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ABSTRACT 

This paper provides a theoretical foundation for some common formulations of inverse problems in 
wave propagation, based on hyperbolic systems of linear integro-differential equations with bounded 
and measurable coefficients. The coefficients of these time-dependent partial differential equations 
respresent parametrically the spatially varying mechanical properties of materials. Rocks, manu- 
factured materials, and other wave propagation environments often exhibit spatial heterogeneity 
in mechanical properties at a wide variety of scales, and coefficient functions representing these 
properties must mimic this heterogeneity. We show how to choose domains (classes of nonsmooth 
coefficient functions) and data definitions (traces of weak solutions) so that optimization formula- 
tions of inverse wave problems satisfy some of the prerequisites for application of Newton's method 
and its relatives. These results follow from the properties of a class of abstract first-order evolution 
systems, of which various physical wave systems appear as concrete instances. Finite speed of 
propagation for linear waves with bounded, measurable mechanical parameter fields is one of the 
by-products of this theory. 
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INTRODUCTION 



Inverse problems for waves in heterogeneous materials occur in seismology, ultrasonic nondestructive 
evaluation and biomedical imaging, some electromagnetic imaging technologies, and elsewhere. 
The data of such problems are often idealized as traces of wave fields on space-time hypersurfaces 
(or surfaces of even lower dimension), and nonlinear least-squares (or other data- fitting) methods 
developed for their numerical solution, that is, for finding the coefficients in the systems of partial 
differential equations chosen to model the wave physics. Newton's method (or one of its relatives) 
is a natural choice of solution method for these optimization formulations, as its rapid convergence 
compensate s to some extent for the very large computational size of reasonable discretizations - see 



for example 



Epanomeritakis et al 



((20091 ) . 



Natural and man-made materials may be so heterogeneous that few limits can be placed on the 
spatial regularity of coefficients representing material param e ters ( f or the case of sedim entary rocks, 



see for instance (jWalden and Hoskenl . 



1986; 



Bourbie et al 



1987 



White et all 



19901 ) ). However, 



traces are well-defined only for fields possessing some space-time regularity, and optimal convergence 
of Newton- like methods requires that objective functions and constraints be regular functions of 
the model parameters, in some sense. Since nonsmoothness of the material parameter (coefficient) 
fields implies nonsmoothness of dynamical (solution) fields, it is a priori unclear that the required 
traces exist, or that the fields depend sufficiently smoothly on the material parameter (coefficient) 
functions to permit Newton-like optimization methods to be applied. While discretized models 
may a fortiori produce nominally well-defined data simulations and smooth objective (data misfit) 
functions, accurate approximation of the continuum fields implies that the continuum limit will 
determine the behavior of solution algorithms for the corresponding discretized inverse problems. 
Also, numerical solutions to discretized inverse problems respect the underlying continuum physics 
only insofar as they converge under refinement of the discretization (at least in principle), and such 
convergence generally requires that the continuum problems have well-behaved solutions. 

The theory presented here provides a mathematical foundation for some common idealizations 
of inverse problems in wave propagation and approaches to their solution, for material models of 
bounded and measurable dependence on space variables. Measurability means roughly the existence 
of averages over arbitrary volumes, which seems a reasonable requirement for physical parameter 
fields. Most such fields are bounded as a matter of principle (positivity of density) or observation 
(p-wave velocity range in metals,...). 



2 



The dynamical laws underlying such problems are symmetric hyperbolic systems, either differ- 
ential or integrodifferential, for a vector-valued fields u defined on a domain ft C R d (of course, 
typically d = 1,2 or 3). Denoting the dimension of the dynamical state vector u by k 6 N, the 
systems treated in this paper take the form 



a— + p(V)u + bu + q * u = f in ft x R; u = for t < 0, 



(1) 



in which a, b are k x k matrix-valued functions on ft, q is a k x fc matrix-valued function on x R 
with q(x, t) = for i < 0, and p(V) is a k x k matrix of constant-coefficient first-order differential 
operators in the space variables: 

d 



p(V)u = Y,PjS^, Pj £R kxk ,i = l,...,d. 



3=1 



dx 



(2) 



The right-hand side vector / is a A;-vector valued function on O x R, vanishing for t < and 
representing energy input to the system. 

Such a system is symmetric hyperbolic if a(x), q(x, t) and pj are symmetric for all x € £1, t £ R, 
respectively j = 1, d, and a(x) is uniformly positive-definite: C*, C* S R exist so that < C* < 
C*, and 

C*/ < o(x) < C*I, a. e. x € ft (3) 

This paper has three central objectives. First, we describe constraints on data (coefficients 
a, b,p, q and right-hand side /) and solution under which (UJ is well-posed, even when the coefficient 
functions a, b, q are permitted to be quite irregular in their dependence on x 6 ft - in fact, merely 
bounded and measureable. Second, we determine a sense in which the solution of ([TJ) is regular 
(C k , k > 0) as a function of its coefficients. Third, we characterize conditions under which traces 
of solutions on time-like hypersurfaces are well-defined, and regular as functions of the coefficients. 



As in the similar study of I Stolid (|2000l ) for second-order hyperbolic systems, we represent ([T]) as 
an instance of a class of abstract autonomous integro-differential systems. We study these abstract 
systems within a framework m odeled after the duality approach intr oduced by J.-L. Lions and 



his collaborators in the 1960's ( Lions 



1971 



Lions and Magenesl . 



1972j). A natural notion of weak 



solution is central to this approach, and complements that of strong (pointwise) solution. The 
class of first-order systems studied here possesses a remarkable property: smoothing solutions in 
time converts weak solutions to strong (pointwise in t) solutions. Specialized to systems such as 
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acoustics or elastodynamics, this property implies that appropriate traces of solutions are well- 
defined, provided that the right-hand-side or source term is minimally smooth in time (only!). 
Existence, uniqueness, and regularity of the solution as function of problem data - both right-hand 
side and (operator) coefficients - also follows from this fact. 

The abstract theory accomplishes even more than that, as it applies to a much wider range 
of dynamics than those commonly occurring in continuum mechanics, accommodating first-order 
systems with operator coefficients. This added generality creates no additional difficulty for the 
basic theory. It is in fact very useful: as will be explained in the Discussion section, it justifies 
certain infeasible-model methods for inverse problems in wave propagation. 

Another useful by-product of the theory is the finite speed of propagation property for hyperbolic 
systems with bounded, measurable coefficients, a result which so far as we can tell is new. This 
follows from the similar property for systems with smooth coefficients via the continuous dependence 
of solutions on the coefficients in the sense of convergence in measure. 

Precise statements of the main results for symmetric hyperbolic systems ([1]) appear in the 
next section, followed by a brief discussion of the related literature. The third section contains 
the definition of the class of abstract systems studied in this paper, and statements of the main 
results to be established concerning it. Proofs of these results follow in the next three sections. 
The seventh section shows how symmetric hyperbolic systems fit into the abstract framework, 
and contains proofs of the main theorems stated in the second section. The eighth section treats 
the hyperbolic case of linear viscoelasticity as an instance of the theory developed earlier. We 
end with a discussion of the potential importance of the general (operator coefficient) case of the 
abstract theory in formulating certain inversion algorithms, representation of energy sources, and 
several other matters not addressed in the body of the paper. Two appendices deal with the skew- 
adjointness of the acoustic grad-div operator, and the case of systems with no memory term (q = 
in (HI), for which initial value problems make sense. 



MAIN RESULTS FOR SYMMETRIC HYPERBOLIC SYSTEMS 

Our results hinge on an assumption concerning the skew-symmetric differential operator p(V), 
which to begin with may be viewed as a densely-defined operator on L 2 (Q,) k , whose domain is a 
subset of C°°(O mt ) fc that contains C^(Q mt ) k . We assume that p(V) extends to a skew-adjoint 
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operator operator P with dense domain V C L 2 (f2) fc , and that V is metrized with the graph norm 
of p(V) or its equivalent, which we denote by || • 

With this assumption, the first of our two major sets of results on symmetric hyperbolic systems 
addresses well-posedness. The conclusion with the most direct implications for the formulation of 
inverse problems is 

Theorem 1. Suppose that, in addition to the hypotheses explained above, the right-hand side f has 
a square-integrable t-derivative: f G JET^ (R, L 2 (Q) k ), and is causal, that is, f = 0, t < 0. Then 
there exists a unique causal u G C 1 (R, L 2 ($7) fc ) n C°(R, V) satisfying ([I]) a. e. in Q for each t G R. 
Moreover, there exists an increasing function C : R+ — > R + , depending on C* , C* , and bounds for 
b, q, so that for every t G R, 



du . . 



+ \H;t)\\ 2 v < C(t) / dr 

L 2 (n) k J o 



ll/(-.r)Hi a(n) » + 



du 



L 2 (n) k 



(4) 



The second main result addresses the existence of traces and their regular dependence on the 
coefficients. Suppose that r C !) is a smoothly embedded m-dimensional submanifold, m < d. 
Denote by Sp the trace map C^{Q int ) k -¥ L 2 (T) k . Suppose that I G N and m G C^{T,R lxk ). 
Define S[m) : C^{n int ) k -> L 2 {T) 1 by 

S[m]u = mSpti- 

Note that S[m] induces a map L 2 ([0, T],C^°(Q int ) k ) -»• L 2 ([0, T],L 2 (T) 1 ), for any T > 0. Abusing 
notation, call this induced map S[m] as well. 

Having parametrized the various possible trace maps, we define the data prediction, or forward, 
operator, Tf^ m , by 

Tf >m (a,b,q) = S[m]u 

in which u is the causal solution of (pQ) provided by Theorem [TJ A suitable domain for J-f iT n is 
M C L°°(^,R s fc y X m fc m ) x L°°(ft,R fcxfc ) x L 1 (R+,L oo (0,R^ m fc m )), defined by 



M(C*,C*, C B , Cq) = {(a, b, q) : C*I < a(x) < C**J for all x G SI, ||6|| Loo(niRfc x fc) < C Bj 



1 311 Li(H + ,z,» (0,11^0) 



< C Q ,q(t) =0for i < 0}, 
M = (J{M(C„C , *,C , b ,Cq) :0<C, < C* ,Cb,Cq G R+}. 
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Theorem 2. Suppose that (i) S[m] extends continuously to V, and (ii) s > 2 and f G H S (R, L 2 (£l) k ). 
Then for any T > 0, T Lm : M -> C°([0, T], L 2 (r) z ) is well-defined and of class C s ~ l . 

A stronger result concerning continuity per se, involving a weaker topology on the coefficient 
set M, is also important. Recall that a sequence of real-valued measureable functions {g m : m G 
N} C L°°(Q) converges to zero in measure iff for any e > 0, 

e m = |{x : |g m (x)| > e}| 

is a null sequence. 

Theorem 3. Suppose thatO < C* < C*,0 < C B ,C Q , {{a m ,b m ,q m ) : m G N} C M(C*, C*, C B , Cq), 
and (a, 6, g) G (C*, C*, Cb, Cq). Suppose further that 

i. a m —> a in measure, 

ii. b m —> b in measure, 

Hi. L \q m — q\ — > in measure 

Denote by u m the (strong) solution of §1$) provided by Theorem 1, with coefficients a m ,b m , and 
q m , likewise by u the solution with coefficients a,b, and q, with common causal right-hand side 
f G H{ oc (R, L 2 (n) k ). Then for any T G R+, 

lim \\u m - u||ioo(r t] ,L 2 (n)) = °- 

One way of generating sequences converging in measure is by mollification. We exploit this 
observation to use Theorem [3] in conjunction with a well-known fact about symmetric hyperbolic 
systems with smooth coefficients, to prove 

Theorem 4. Suppose that (a, b, q) G M, e > 0, and that r G R+ satisfies 

d 

ro(x) + S>i&>e (5) 

i=l 

for almost every x G SI and every £ G R d for which |£| = 1. Suppose further that w C f!,T > 0, 
and that f G iTA (R, f2) is causal and satisfies /(x, t) = if r|x — xo| + to — t > for every 
(x ,to) G w x [0,T]. Tften i/ie causal solution u G C 1 (R, L 2 (Q) k n C°(R, V) o/ flU wii/i coefficients 
a,b,q and right-hand side f vanishes in uj x [0, T]. 



G 



Example. Linear acoustics provides an important example of the framework just described. Acous- 
tic wave propagation does not include the memory effect modeled by the convolutional term in (fT]), 
but illustrates several other features of the class of problems studied in this paper. 

For this example, d = 3, and we suppose that the domain Q, C R 3 is smoothly bounded (the 
smoothness requirement can be relaxed to some extent, as discussed briefly in Appendix A). The 
momentum balance and constitutive laws of linear acoustics relate the excess pressure p(t, x) and 
velocity fluctuations v(i,x) = (t>i(x, t), t>2(x, i), W3(x, t)) T , x G R 3 , to mass density p(x), bulk 
modulus k(x), constitutive law defect g(i,x), and body force density f(t,x) by 



<9v 

1 dp 
k dt 



In this example, k = 4, and 



-Vp + f , 
-V-v + g. 



/ \ 

P 



(6) 



Au = diag [ — , p. 1 1. f> j //. '/ 



V2 



g L 2 (ny 



V U3 / 

defines a bounded self-adjoint positive-definite operator A G i3+ mm (L 2 (0) 4 ), provided that logp, log k G 
L°°(fi). 

The right-hand side of ([U]) involves the matrix partial differential operator 



p(V) 



9 9 
9xi &2 9x3 



\ 



9 
9xi 



9x 2 U 



V 9X3 









(7) 



This matrix operator defines a skew-adjoint operator with domain V = Hq(Q) x Hh (fl) C L 2 (f]) 4 . 
[-H^iy (O) consists of v G L 2 (fi) 3 for which the distribution V • v G L 2 (f2) 3 . For the reader's 
convenience, we present a proof that p(V) : V — > L 2 (Q) 4 is skew-adjoint, in Appendix A.] 

Finally, the source vector / is defined by 

f(t) = (g(t,.),i(t,-)f GL\n)\ 
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The acoustics system (|6|) is symmetric hyperbolic and satisfies the conditions of Theorem [TJ 

Note that if k and/or p are discontinuous, then the form of the equations (jGj) immediately 
implies that no solutions of class C 1 may exist, even if the right-hand side / (that is, the body 
force density f) is smooth. Physically reasonable fluid configurations thus exist for which solutions 
in the classical sense cannot be defined, for example piecewise homogeneous mixtures with jump 
discontinuities of density and/or bulk modulus across smooth interfaces. However, Theorem Q] 
provides a solution with physical sense, as the strain energy 



is well-defined for any such solution. 



p2 

p V . v _| 

K 



(;t) = -{ U (t),Au(t)) L 2 (n) 4 



Theorem □ gives u G C°(R,V), provided that / G fl£ c (R, L 2 (^) 4 ). To obtain differentiable 
dependence on the coefficients it is necessary to sacrifice one more time derivative. This sacrifice 
seems minor, as modeled energy sources (represented by the right-hand side in ([1])) are in many cases 
quite smooth in time, reflecting the finite bandwidth of energy generation and recording equipment 
and the loss of high frequencies to dissipative mechanisms during propagation. Membership in 
V = Hq(Q) x Hh (ft) implies that some (but not all!) traces of u are well-defined. In the notation 
of Theorem^ given a smoothly embedded hypersurface T, choose I = 1 and m = mp or m = mjv> 
where 

m D (x) = [1,0,0,0], m N (x) = [0, ni(x), n 2 (x), n 3 (x)], x £ T, 

and n = {ni,n2 : n%) T is a smooth normal field on T. Both 5[m£»] and S[m^] extend continuously 
to V. Note that traces of tangential components of v do not so extend. The end result is that 
both T^ mD (pressure data) or T^ mN (normal velocity data) are well-defined and differentiable as 
functions of the coefficients, of class C s ~ x if / G -fff oc (R, L 2 (Q) A ) (Theorem [2|) . The domain of 
either version of J- in this case is the open set M ac C L°°(f2) 2 , defined by nondimensionalizing 
choices of scale s K ,s p G R+ and 

M ac (C7*,CT) = {(k, P ) G L°°(ft) 2 : (s k k)~\ {s.p)- 1 < (C*)~\ s k k, s pP < C* a.e. in 0}; 

M ac = |J{M ac (C*,C*) : < C m < C*}. 

The content of Theorem |3] in this case is just what one would expect: waves move with speed at 
most 

IIvWpIIl-co)- 
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We end this section with a brief overview of prior work. The well-posedness of symmet- 



ric hyperbolic systems wit 



(jCourant and Hilbertl . 



i regu lar 



1962 



Lax 



smooth) coefficients has been well-understood for decades 



2006). Some effort has also been devoted to describing solutions 
for first- or second-order hyperbolic systems with less-than-smooth (but still continuous) coeffi- 
cients, mostly focuse d on the propaga t ion o f regular i ty along bicharacterist i cs, or the existence of 



traces (for example. (IBeals and Reed 



Lasiecka and Trigiannil . 



1989 



1982; 



Bao and Svmes 



Svmea . 



1991 



1983: 



Beals and Reedl . 



19931 : Smith . 19981 



1984 



Lasieckal . 



1986 



19981 ) . Those works treating the 



continuity or differentiability of the solution as a function of the coefficients have mostly required 



more smoothness of the coefficients than is a 
dime n siona l prob 



1996 



Salo 



2006 



e ms (1 B amb e rger et al 



1979 



Stefanof and Uhlmann 



lowed in the present work, or dealt only with one 



Svmes 



1986 



Lewis and Svmed. 



1991 



Bao and Svmes 



2009). An exception is (jFernandez-Berdaguer et al. 



19961 ). in which (results which imply) a special case of Theorem [2] is established. A large num- 



ber of articles have appeared o n abstract first- or second-order 
inverse problems, fo r example (ILavrentiev et al 



Ramm and Koshkin 



2001 



Awawdeh . 



2010; 



1986; 



Chou 



Orlovskv et al 



li, 



hyperbolic equations and related 



1991 







Lorenzi and Ramm . 



2001 



20101 ). mostly using the theory of 



strongly continuous semigroups (or cosine operators, in the second order ca se) to obtain a hold o n 



well-posedness. Of these, the closest in spirit to the present work is that of 



Orlovskv et al 



mm), 



which treats second order systems via the cosine operator approach but observes that smoothness 



in time of data imp 
result in our work. 



ies that weak solutions are strong solutions, analogous to a crucial intermediate 



Choullil (|199ll ) established that a first-order abstract integro-differential initial 



value problem, resembling those of our abstract framework to be detailed in the next section, is 
well-posed, and also proved uniqueness for the solution of a related inverse problem. The form of 
the integral term is actually a generalization of our convolutional memory operator, and the require- 
ment on the "spatial" operator is also a generalization (generates a strongly continuous semigroup 
- in the problem considered here, this operator is skew-adjoint, and generates a unitary group). 
However Choulli formulates an inverse problem which involves only determining lower-order terms, 
rather than the principal part, the inverse problem data does not involve time-like traces, and 
Choulli does not investigate the regularity of any analogue of T . 

The imm ediate predecessor of our work is Chapter 2 of the second-named author's PhD thesis 



dStolk . 



2000), which treated second-order hyperbolic systems with nonsmooth coefficients, includ- 



ing the displacement formu 



and his collaborators (Lions 



ation of elastodynamics, b e ginnin g with the techniques of J.-L. Lions 



1971 



Lions and Magenesl . 



1972j) but going beyond well-posedness to 
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study the dependence of solutions on coefficients. The results on strong convergence and Gateaux 
differentiability (Theorems [7] and [8]) are direct translations of arguments from this source, as is 
the observation that convergence in measure of L°° multipliers implies strong convergence of the 
corresponding multiplication operators on I? (Lemma 

A CLASS OF ABSTRACT FIRST ORDER EVOLUTION EQUATIONS AND 

THEIR PROPERTIES 

Let H be a separable real Hilbert space, with inner product (•, •) and norm || • ||. We will denote by 
V C H a dense subspace, itself a separable Hilbert space with inner product (•, -)y and norm || • \\y, 
defining a stronger topology on V than that induced by H. B(H) is the Banach space of bounded 
linear operators on H, with the operator (uniform) norm. B symm (H) is the subspace of bounded 
self-adjoint operators, and B^ mm (H) is the cone of bounded self-adjoint coercive operators. We 
suppose that 

• A € B+ mm (H), < C* < C* so that CJ < A < C*I. 

• Be B(H); 

• Q G L 1 (R,S symm (iI)) n C°(R+,B(H)), and is causal, that is, Q(t) = for t < 0; 

• P is a skew-adjoint operator with domain V, for which the graph norm is equivalent to || • ||y. 

The kernel Q defines a continuous linear operator R : L^_(R, H) — > L^_(R, ff) on the space 
L 2 + (K,H) = {v G Lf oc (R,#) : for each T G R, u G L 2 ((-oo, T], H)} 
with the natural countably normed topology, by 

R[u](t) = I Q(t- s)u(s)ds. (8) 



R is well-defined since Q is causal. Note that if u G L^ oc (R, H) is causal, that is, suppii C [T, oo) 



J loc^ 

-2 



for some T G R, then u G L^_(R, i?[n] is causal also and suppi?[n] C [T, oo). The formal 
(distribution) adjoint 

R*[u](t) = I Q{s-t)u(s)ds 



satisfies a similar condition: if supp-u C (— oo,T] for some T G R, then suppi?*[n] C (— oo,T]. 
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The components described above combine to yield the formal evolution problem: find an H- 
valued function of t, say u, which solves, in a suitable sense, 



Au + Pu + Bu + R[u\ = f, 
in which the right-hand side / is also an H- valued function of t. 



(9) 



We follow ([Lions, 



1971 



Lions and Magenesl . 



1972 



Stolid, 



2000) in defining weak solutions u G 



Lf oc (R, H) of the formal evolution problem ([9]) by integration against smooth test functions. Inspec- 
tion of ([9]) suggests that a similar constraint must also be placed on the right-hand side: henceforth, 
we assume / G L^ oc (R, H). Because the operator kernel Q may have unbounded support, we must 
constrain the growth of candidate members of Lf (R,H) on the negative half-axis. Accordingly, a 
weak solution of the formal evolution problem ([9]) is a member of u G Lj 2 (R, H) satisfying 

1. nGL^(R,if); 
2. 



J (u(t), {Atf + - B*<p - R*[<t>))(t)) dt = - J (f(t), <f>(t)) dt- 
for all 4> G Cg°(R,V). 



(10) 



Note that since R*[4>] is supported in the half-axis (— oo,supsupp^>], and is square-integrable, 
assumption 1. implies that the last term on the left-hand side of (|10p is well-defined. 



We will reserve the term strong solution to designate a function u G C 1 (R, H) n C°(R, V) n 
Li(R, H) which solves Q pointwise. Clearly a strong solution is a weak solution. Existence of a 
strong solution in this sense also implies that / G C (R, i?). 

Because of the causal assumption on the convolution kernel Q, existence of a causal weak 
solution vanishing for t < T$ implies that the right-hand side / must also be causal, in fact vanish 
for t < T Q . 

Concerning the fundamental questions of uniqueness and existence, we prove 

Theorem 5. Assume that H, V , A, B, P, and Q have the properties listed above, and that 
f G Lf oc (R, H) is causal. Choose Tq so that supp/ C po,oo). Then there exists a unique causal 
weak solution u G C (R, H) of |P|) with suppu C [To,oo). For every T > To, there exists Ct ,t > 
depending on Tq, T, C* ; C* , an d \\Q\\l 1 (r,b(H))> so that for any t<T, 

\Ht)\\l < Ct ,t f ds\\f(s)f. (11) 

J — oo 
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Regularity in time of the right-hand side implies that the weak solution is strong: 

Theorem 6. In addition to the assumptions of Theorem^ suppose that f G L%(R, H)C\Hf oc (R, : H), 
k > 1. Then the causal weak solution u of (|10|) satisfies u G C k (R,H) n (7 1 (R, V). For every 
T > To, there exists Ct ,t > depending on To, T, C*, C* , \\B\\q^, and WQW^CRJSfH))} so that 
for any t <T, 

||n^(t)|| 2 v <C TOiT ^ / ds\\f^(s)f, j = Q,...k-l. (12) 

In particular, u G C 1 (R, if) n C°(R, V) is a strong solution of ([9]). For each j = 0, k, is 
i/ie weaA; solution of ([9|) wii/i right-hand side f^\ 

Regularity of weak solutions u as functions of the coefficients A,B,Q is expressed through 
estimates which are uniform over certain sets of problems (that is, certain sets of coefficients), 
which are important enough to merit a definition: given < C* < C*,0 < (7b, < Cn, define 

V(C*,C*,C B ,C Q ) = {(A,B,Q) G B+ yTDm (H) x B x [L l (R,B syima (H)) nC°(R + ,B(H))] : 

C*J < A < C*J, ||-B||b(^) < C B , ||<9||ii(R,if) + \\Q\\l°°(r,h) < Cq}; 

7? = U{V(C„ C*, C B , C Q ) : < a < C*, < C B , < C Q }. (13) 

Strong convergence of the coefficient operators, in an appropriate sense, leads to uniform con- 
vergence of the corresponding weak solutions on compact sets in R. For simplicity, and because all 
of the examples we have in mind satisfy this restriction, we assume that all of the systems appearing 
the the formulation of this continuity result share the same "spatial" operator P. It is possible to 
weaken this assumption, that is, to approximate P as well; we leave the formulation and proof of 
this (slightly) stronger result to the reader. 

Theorem 7. Suppose that H,V are as described, P : V — > H is skew-adjoint, < C* < C*,0 < 
Cb, < C Q , {(A m , B m , Q m ) : m G N} C V(C*, C*, C B , C Q ), (A, B, Q) G 7>(C„ C*, C B , C Q ), and 

1. lim m ^oo || (Am — A)w\\ — > for all w G H; 

2. lim-m^oo \\(B m - B)w\\ — > for all w G H; 

4- the convolution operators R m , R with kernels Q m , Q satisfy lmim^oo ||i? m [u;]— ||l 2 ((-oo,T]) — > 
for any T G R, all w G L 2 + (R,H). 
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Let u m , respectively u, be causal weak solutions of the differential equation (|9j) with coefficients 



(A m , P, B m ,Q m ), respectively (A, P, B,Q). and (common) causal right-hand side f E Lj* (R, H). 



Then for any choice of Tq < T E R, 

lim \\u m — u \\l°°(\t ,T\,H) = 

With additional regularity of the right-hand side, solutions of ([9]) have directional (Gateaux) 
derivatives as functions of the coefficient operators: 

Theorem 8. Suppose that H,V are as described, P :V —> H is skew-adjoint, (A,B,Q) E V, and 
f E i^ 1 (R, H) is causal. Denote by u E C 1 (R, H) n C°(R, V) the causal strong solution of with 
these choices of coefficients and right-hand side. Assume that SA,5B E 13(H), 5 A is self-adjoint, 
and 5Q E L 1 (R, B symm {H)) f) C°(R+, B{H)), 5Q = for t < 0. Define for h E R 

A h = A + h5A, B h = B + h5B, Q h = Q + MQ. 

For sufficiently small h, {ABh,Qh) E "P, so that the problem ([9]) coefficients Ah, P, Bh,Qh 
and right-hand side f has a unique (strong) solution Uh E C 1 (R, H) n C°(R, y). Denote by 
6u E C°(R, -ff) i/ie weafc solution of the formal evolution problem 

Abu + P5u + B5u + = -Mti' - 5Bu - SR[u] , (14) 

in which R (5R) is the convolution operator with kernel Q (SQ), as usual. Then for any choice of 
T <T E R, 



lim 

h->0 



u h -u 

- ou 



h 



= 0. (15) 

L°°([T ,T\,H) 



With the correct choice of topology, the solution is (Frechet) differentiable as a function of the 
coefficients, assuming as in Theorem [8] that the right-hand side is somewhat regular in t. To express 
this fact in a form most useful for application to inverse problems, introduce another Hilbert space 
W (of "measurements"), and a "sampling" map S : V — > W, assumed continuous, Of course S 
induces a continuous map : C k (R, V) — > C k (R, W) for any k E N; we abuse notation by writing S 
for this induced map also. 

For causal / E #/ oc (R, H), define Tf.T^ C X (R, W) by 

T f [A,B,Q]=Su, (16) 

in which u E C 1 (R, H) n C°(R, V) is the causal (strong) solution of ([9]) with coefficients A,P,B,Q 
and right-hand side /. It follows from Theorem [6] that J-f is well-defined. 
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Theorem 9. Suppose that f G Hh (R, H) is causal, k > 2, and view V as an open subset of the 
Banach space M = B symm (H) x B(H) x [L 1 (R, B symm (H)) n C°(R+, B(H))], with norm 

(A, B, Q) h-> \\(A,B,Q)\\ M = \\A\\b(H) + \\ b \\b(H) + m ax(||Q|| L i( R+jB (H)), 
Then for any T < T x G R, J> G C*" 1 ^, C°([T , Tx], W)). 

Remark: Note that this theorem expresses "differentiability with a loss of one derivative" , in two 
senses. First, taking k = 2, even though / G C (R, H), the map J 7 is only once differentiable. 
Second, even though T a priori takes values in C 1 (R, W), it is of class C 1 only as a map with 
range C (R, W). In these respects Theorem [9] is sharp - counterexamples can be adduced to show 
that neither of these losses-of-derivative are artifacts of the proof. 



THE ENERGY INEQUALITY 

Define the energy E(t) of a weak solution u of by 

E{t) = \(u{t),Au{t)) (17) 

It follows from the definition of weak solution that E is well-defined almost everywhere, and locally 
integrable. Because A is positive-definite, 

C4u(t)f < E(t) < C*Ht)|| 2 (18) 

hold for almost all t G R, for suitable C* > C* > 0. 

Remark. In the linear acoustics example presented in the previous section, E(t) is precisely the 
mechanical energy of the acoustic field at time t. 

In this and the following sections, we will use C to denote a generic nonnegative constant 
depending on C*,C*, , and ||Q||Li(RB(fn), and possibly on other quantities as noted. 

We shall repeatedly use following property of weak solutions: 

Proposition 1. Suppose that u G Lf oc (R, H) is a weak solution of ([9]). Then for any r\ G Cq°(R) ; 
i] * u is a strong solution, and in fact 

rj*u e C°°(R, V). (19) 
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Remark: In applications such as acoustics, or viscoelasticity to be discussed later, H and V 
are function spaces, and membership in V entails additional regularity beyond that required for 
membership in H. In such applications, the content of this theorem is that smoothing in time also 
"smooths in space" , in the sense that the values of the smoothed weak solution are confined to the 
subspace V. 

Proof. Choose the test function (j> in (|10p to have the special form <p(s) = rj(t — s)w, where t £ R, 
w 6 V, and rj e Cg°(R). Then 



(rj * u(t), Pw) = I J rj{t — s)u{s) ds, Pw^j 
(u(s), P(rj(t — s)w)) ds 



(u(s), Arf(t — s)w + B*rj(t — s)w 

+R*[rj(t- -)w](s))] ds- (77* f(t),w) 
(A{rf *u)(t) + B(r]*u)(t) + R[rj * u](t) , w) , (20) 



where the third equality is simply a rearrangement of (|10p with the special choice of test function 
4>(s) = rj(t — s)w mentioned above, and the last holds amongst other reasons because R is also 
convolutional. The right-hand side of (f20|) is bounded by a ^-independent multiple of 
therefore so is the left. Therefore rj * u takes values in the domain D(P*) of the adjoint P* for 
any 77 S Cg°(R). But P is skew-adjoint, so T>(P*) = T>(P) = V. Therefore we can shift P to the 
left-hand side of the inner product on left-hand side of (|20p . to obtain 

{-P(i]*u)(t),w) = (A{rf * u)(t) + B(rj * u)(t) + R[q * u}(t) - (rj * f)(t),w) . (21) 

As dHJ holds for every w £ V and V C H is dense, it follows that 

- P(r] * u)(t) = A(rj' * u)(t) + B(rj * u)(t) + R[q * u](t) - (rj * f)(t), (22) 

that is, © is satisfied pointwise in t. 

Since the right-hand side of (|22p is of class C 0C (R, H), so is the left-hand side. Denote by 
T$ t G B(Lf oc (R, H)) the translation operator by 5t, defined by Tg t u(t) = u(t + St). For each k G N, 
let 



j=-m k 
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be a finite difference operator for which u G C°(R,H) has k derivatives if and only if A^stu 
converges pointwise in t for each i = 1, ...,k to tjW as St — > 0. From (|22l) it follows that for each 

- A Mt (P(77*n))(t) = -P(A fc ,«(»7*u)(t)) 

= A M i(A(r ? '*n) + J B(7?*n) + J R[r/*n]-7?*/)(t). (23) 

Since the right-hand side of (1231) converges for each t, so does the left-hand side, both versions, as 
5t — > 0. Thus A*. 54(77 * converges in the graph norm of P, hence in the sense of || • ||y, as 
<ft -> for each t G R, fc G N. Thus ij*ue C°°(R, V). □ 

Proposition 2. Let u G Tj (R, i?) 6e a weaA; solution of ([9]) ; i£ G (R) its energy as defined 
in (|17p. T/ien 

• a/ter modification on a set of measure zero, E is continuous; 

• if in addition f is causal, supp f C [To, 00), then for any T G R there exists Ct ,t > so 
that for t G (—00, T], 

E(t) < C To , T f ll/H 2 . (24) 

J — a 



-oo 



Proof. Let r) n G C£°(R) be an approximate identity, that is, r) n (t) = nn(nt), where 

77 G C °°(R), 77 > 0, f q(t) dt = 1, supp 77 C [-1, 1]. (25) 

Define 

En(f) = \{{Vn * u)(t), A( Vn * u)(t)). (26) 

bmce 77„ * u — > u m Lf oc (R,iT), E n -> E in L\ oc (Yi). Thanks to Proposition Q] r] n *u£ C°°(R,V) 
is a strong solution of © for each n G N. 

For each n, £7 n is smooth; differentiating E n , obtain for any s,t G R 

~!T {T)dT 

t 

({Vn * u)(t), MVn * u)'(t)) dr 

t 

[~{(Vn * «)(r), (P(t7„ * u)(t) + P(r? n * n)(r) + R[r] n * n])(r)) 

+ (0m*/)(t),0m*u)(t))] 
t 

[-((% * «)( r )) * «)(t) + P[r/ n * «])(r)) 

+ ((%*/)(r),(^*«)(r))] dr. (27) 
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thanks to the skew-symmetry of P. 

Since convolution with 77 commutes with the convolution operator R, and with the actions of 
the other operators appearing in Q, the identity (|2"7|) implies that 



\E n (t) - E n {s) 



dE. 



dr 



~{t) dT 



\{r] n * (Bw(-))(r),7^*«(r))| 
+ \((7 ln *R[u])(r),r ]n *u(T))\ 
+ \(Vn * /(t),% * u{t)}\ dr 

< (||S|| + l)(j['||(%*tt)(r)|| a (lr) 

+ [\\\(Vn*R[u])(T)\\ 2 + \\(r, n *f)(T)\\ 2 )dT 



(28) 



Since u, R[u], and / are locally square-integrable, for each t € R and e > 0, there exist At(t,e) > 
and an N(t,e) S N so that for \s — t\ < At(t,e) and n > N(t,e), 

ft+l/n 



/t+l/n rt+l/n 
\\R[u}\\ 2 < e, and / 
_ — 1/n J s—l/n 



< e, 



whence (f28|) implies that for n > N(t,e), \s — t\ < At(t,e), 



\E n {t) - E n {s)\ <Ce. 



(29) 



Continuity of E n implies existence of At(t, e) > so that for \s — 1\ < At(t, e) and n < N, \E n (t) — 
E n (s)\ < Ce. Thus the inequality (|29p holds for all n S N if s satisfies \s—t\ < min(At(t, e), At(t, e)). 
Since t £ R, e > are arbitrary, we have shown that the sequence {E n } C C (R) is equicontinuous. 



Choose To < T G R: it follows from the definition (|26|) and Young's inequality that 

fT f-T+l 

I E n <C \\u\\ 2 

JTq Jt -1 



is bounded independently of n. For t G [To,T], 

r-T 



(T-T )E n (t)= I (E n {t)-E n {s))ds+ [ E n (s) 

JTn JTn 



ds 



,-T 

Apply Young's inequality to the convolutions with rj n appearing in the inequality 
that for To < s, t < T, 

rT+l 
'To- 
ll 



\E n (t) - E n (s)\ < C [ T+ \\\u\\ 2 + ||i?M|| 2 + 

JTo-l 



(30) 
(31) 

to conclude 
(32) 



Taken together, ([30]) . (f3Tj) and ([32]) imply that {E n } is a bounded subset of C°([To,T]). According 
to Ascoli's theorem, {E n } is precompact in C°([To, T]), hence a subsequence converges uniformly to 
a continuous limit. Since the subsequence is necessarily also /^-convergent, and T G R is arbitrary, 
the first assertion of the theorem is established. 

In view of the continuity of E, we may take the limit n — > oo on both sides of the inequality 
(|28p along the uniformly convergent subsequence whose existence we have just established. Since 
r] n * u — > u in L 2 ([To, T], H), the right hand side converges, and we obtain 

\E(t) - E(s)\ < C !\\\uf + \\R[u]f + Il/H 2 ). (33) 



We have assumed u to be causal, but this assumption has not appeared in the reasoning up to 
now. It allows us to take s — > — oo in ()33p . In view of the equivalence of \[E and the norm || • || 
(inequalities ([18]) ). the inequality (|33|) implies that 



E(t) <C I {E + 

J — oo 



Gronwall's inequality then yields the second conclusion. □ 
Corollary 1. The energy E of a weak solution u of Q, as defined above, satisfies for any s,t G R 
E(t)-E(s)= ! \-Bu{t) - R[u]{t) + f(T),u(r)}dr. (34) 

J s 

Proof. Continuity of E and convergence of i] n * u to u in L 2 oc (R, H) allows us to take limits on 
both sides of (EH). □ 



Corollary 2. Suppose that u\,U2 £ L 2 oc (R, H) are causal weak solutions of |P)). Then u\ = U2- 

Proof. The conclusion follows immediately from the energy inequality (|24|) , applied to the difference 
u = u\ — U2, which is a weak solution with / = 0. □ 

Corollary 3. Suppose that u G L 2 oc (R, H) is a causal weak solution of (0). Then u G C°(R, H). 

Proof. For 5t G R, denote by ust the member of L 2 oc (R, H) defined by ust{t) = u[t + 5t). Then 
ust is a causal weak solution (the only one, thanks to Corollary [2]) of ([9]) with / replaced by 
f st G L 2 (R,H), defined by f 5t (t) = f(t + St). The translation group acts strongly continuously on 
L 2 , i.e. \\fst — /||l 2 (r,h) — >• as (5f — ^ 0. Since the difference ugt — u is a causal solution of Q with 
right-hand side fst — f, it follows immediately from (j24"j) that ||w<5((i) — u(i)||# — > as 5t — > for 
any t G R, that is, u G C°(R). □ 
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Corollary 4. Suppose that 

1. K C B(H) is a bounded set; 

2. C C B(V,H) is a bounded set of skew- adjoint operators on H with (common) domain V, 
whose graph norms are all equivalent (to each other and to the norm in V); 

3. Ai C B(H) is a bounded set of self-adjoint, uniformly positive definite operators: there exist 
constants < C* < C* so that for all A 6 At, 

CJ < A < C*I; 

4- Qc ^(R, B syram (H)) P| C°(R+, B(H)) is a bounded set of causal operator-valued functions: 
ifQeQ, then Q(t) = 0fort<0. 

Let the set VcAixCxfCxQ parametrize a family of formal evolution problems of for ([9]) ; 
with coefficients A £ Ai,P G C,B £ K., and Q G Q, with common right-hand side f £ L 2 (R,H), 
and let U C L^ oc (R, H) be a corresponding family of causal weak solutions. Then U C C°(R, H) is 
equicontinuous. 

Proof. That WcC° (R, H ) is the content of the last Corollary. It follows from the proof of the basic 
energy estimate (|24p that the constant C appearing in its right-hand side may be chosen uniform 
over V - indeed, the bounds defining the sets listed in conditions 1-4 above are precisely those on 
which our constants, canonically notated C, depend. Therefore (|24p implies that for u £U, 

i r-t ft+St 

\\u(t + 6t)-u(t)\\ 2 <—E UH „ u (t)<C / \\f 5t -ff = C Il/H 2 
from which a uniform modulus of continuity follows. □ 

Additional regularity in time of the right-hand side / translates into additional regularity of 
the solution. 

Corollary 5. Suppose that f £ L^ Q (R,H) n Hf oc (R,H), k £ N, and u £ Lf oc (R, H) is a weak 
solution of Then u £ C k (R,H); moreover, for j = l,...k, is the weak solution of (fTUj) 
with right-hand side f^>. 
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Proof. Corollary [3] is the case k = 0. 

The case k = 1 provides the induction step, so we discuss it first. Since / G Hy oc (R, H), using 
the notation of the proof of Corollary O 

j t Ust-f)^f as5^0 

in mean square. The energy estimate (|24p implies that for each t, 

Y t (u&t{t) - u(t)) 

has a limit as 5t — > 0, whence u is differentiable and u' satisfies (|10p with right-hand-side /'. The 
previous Corollary shows that u' is continuous. 

This argument also serves as the induction step to establish the assertion of the Corollary for 
k > 1. □ 

Smoothing the solution in time leads to a different sort of "regularity" : 

Corollary 6. Suppose that T G R, / G L^ o (R,H), AT > 0, rj G C§° ((-AT, AT)), and /ei u 
denote a causal weak solution of (|10p . T/ien /or every T > To, i/iere exists Ct ,t,at > depending 
onT ,T, AT, ||r/|| L i (R); HVIIlMR)' C *> \\ b \\b(h), and \\Q\\ l ^r,b(h)), so that for any t < T, 

/T+AT 
ds\\f(s)\\ 2 . (35) 

Remark: Note that this inequality is a pointwise bound on r/*u in the norm of V. In applications, 
V is compactly embedded in H , so this is potentially a much stronger statement than the obvious 
//-norm bound which follows directly from (|24p . 

Proof. The left hand side makes sense, of course, thanks to Proposition [TJ The identity (|20p may 
be re-written as 

-(P(r,*u)(t),w) = J {u(s),-Ar)'(t - s)w + B*r,(t- s)w + R*[r)(t - -)w](s)) ds - (n * f(t),w) 
for any w G H (not just V\). Since supp(ry) C [—AT, AT], for any t < T„ 

fT+AT 

\(P(r,*u)(t),w)\ < C\\w\\ ds(\\u(s)\\ + \\f(s)\\) (36) 



r T+AT \ \ 

< C ToiTAT \\w\\ I / da||/(s)|| 2 (37) 
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thanks to the the energy estimate (|24p and by-now familiar use of Young's inequality and the 
assumptions on the various operators and quantities in the problem formulation. Now choose 
w = P(r]*u)(t) to obtain a bound on its //-norm. In combination with (|24p and Young's inequality, 
this estimate implies a bound of the required form on (77 * u)(t) in the graph norm of P, that is, in 
the norm of V. □ 



EXISTENCE OF WEAK SOLUTIONS: PROOFS OF THEOREMS ?? AND ?? 



The proof of existence follows the pattern laid out by iLions! (|197ll ). which in turn echos Cauchy's 
proof of the fundamental theorem of ordinary differential equations. We define a Galerkin method, 
show that it converges, and finally that the limit is a weak solution. Note that no rate of convergence 
follows from this argument; in fact it is easy to see that none can be expected. Of course, Proposition 
[2] has already assured that the solution so constructed is the only solution. 

Proof, of Theorem [5j 

In view of the energy estimate (Proposition [2J), at most one such solution exists, and any 
sequence of weak solutions, corresponding to an L 2 (R, -£T)-convergent sequence of right hand sides 
/, must itself be L 2 (R, H) -convergent. Therefore it suffices to establish existence of solutions for a 
L 2 (R, H)-der\SQ set of right hand sides. In particular, we may assume that / G C°(R,H), without 
loss of generality. 

Since V is separable (with respect to the graph norm of P) and V C H is dense, countable 
linearly independent subsets {w k } k x ^ =1 C V exist for which finite linear combinations are dense in 
V, hence in H. Without loss of generality, assume that {w k }'^L 1 is (H-) orthonormal: (w k ,Wi) = 

Define m x m matrices A m (symmetric positive definite), P m and B m by 

AS = (Aw k , Wl ), (38) 
P% = (Pw k , Wl ), (39) 
Bt, = (Bw k , Wl ), (40) 

for 1 < k, I < m, and the operator R m on L 2 oc (R) m defined analogously to (|SJ) by 

R m U m (t)= I (Q m (t-s)U m (s)ds, Q k n l (t))ds = (Q(t)w k ,wi), l<k,l<m. 
J —00 
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Note that Q m G L X (R, £ symm (R m )) f] C°(R+,£(R m )) is causal {Q m {t) =0,t< 0). 
Define F m G C°(R) m by 

F?(t) = {f(t),w k ), l<k<m. 



A minor modification of a standard contraction mapping argument (see for example ICoddington and Levinson 



(j 19551 )) shows that for each m G N, the initial value problem 



dU m 

j\m I pmjjm _|_ j^mjjm _j_ J^ m JJ m 

f/ m (t) = 0, f < T . (41) 

has a unique solution [7 m G C 1 (R, R m ). 

For each m G N, define u m G C^R, V),f m G C°(R, £T) by 

fc=i fc=i 

Then the system (|4ip satisfied by U m , together with the if-orthonormality of {wk}, implies that u m 
is the weak solution of the evolution equation ^ with right-hand side f m . The energy estimate (|24p 
shows that the sequence u m is bounded in Lf oc (R, H) , hence by the Tychonoff- Alaoglu theorem 
and a diagonal process argument weakly precompact in Lf oc (R,H). Denote by u m n) a weakly 
convergent subsequence, and by u its weak limit. Since u m (t) = for t < Tq and all m G N, the 
same is true for u. 

To see that the limit u is a weak solution of ©, introduce for each thq G N test functions ip of 
the form 

m 

1> = Yl fa ® ^' ^ G C o°°(R)- (42) 
fc=l 

For / sufficiently large that m(Z) > m , (f m (t),?p(t)} = (f(t),ip(t)), (Pu m ^,ip) = -(u m ^, Pip), 
etc. So 

Letting I — > oo, it follows that u satisfies ()10p for all test functions ^ of the form given in equation 
(|42p . Since linear combinations of w m 's are dense in V, the set of functions of the form (|42p is 
dense in Cq(R, V), whence u is a weak solution of Q. □ 

Proof, of Theorem [6j 

We give the proof for k = 1; the general case follows by a straightforward induction argument. 
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According to Choose a Dirac sequence {ij n : n G N} C Cg°(R) as in the proof of Proposition 
[2j According to Corollary [6J rj n * u G C fc (R, V) for each n G N. According to Corollary 
u G C 1 (R, i/), so the first term on the right-hand side of ()20p may be integrated by parts to yield 

({r] n *u),Pw} = T]*(u,Pw) (43) 
= % * (Av! + Bu + R[u] - /, w) (44) 

for any w G V. Both sides are continuous in i, so the limit as n — > oo is valid pointwise. Thus 

(u{t),Pw) = (Au'{t) + Bu(t) + R[u}(t)- f(t),w) (45) 

for any w G V, t G R. Pointwise bounds on ||ti(t)|| (Proposition [2]), (Corollary [5]) and the 

standing assumptions on the various operators imply that 

(u(t),Pw) < C To , T \\w\\ (/ Jll/ll 2 + ll/'ll] 2 ) 2 

for all w G V, t G (— oo, T] (and any T, but the constant Ct ,t depends on To and T), which shows 
that u(t) G P(P*) = V(P) = V for each t G R. Thus 

(Pu(t),w)<C To ,T\H\ (/ Jll/ll' + H/'ll] 2 ) 2 

for all u; G -ff, t G (— oo,T]; taking u; = Pu(t), obtain the pointwise bound (|12p for fc = 1. 
That u G C°(R, V) follows exactly as in the proof of Corollary O via the strong continuity of the 
translation group on i7 1 (R, H). □ 

CONTINUOUS DEPENDENCE ON PARAMETERS: PROOFS OF 

THEOREMS ??, ??, AND ?? 

The proof of Theorem [7] divides into three steps. In the first step (Lemma [T]), we show that the 
assumptions of the theorem imply Lf oc (R, H)-weak convergence of the sequence of solutions. The 
second step leverages this result to show that the sequence converges H-weak\y, pointwise (Lemma 
[5]). The last step combines these two observation with strong convergence of the coefficients to 
obtain convergence in norm of the solution sequence. 

Lemma 1. Under the conditions of Theroem^ u m converges weakly to u in Lf oc (R, H). 
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Remark. In order that R m converge to R pointwise, as assumed in the statement of the preceding 
theorem, it is sufficient that Q m — > Q uniformly in R + . 

Proof. The bounds implied by the energy estimate (Proposition [2]) are uniform over bounded sets of 
coefficients as described in the statement of the theorem. Therefore {u m } is bounded in (R, H), 
hence has an L^ oc (R, H )-weakly convergent subsequence u m m , with limit u G Lf oc (R, H) . Note 
that l| oc (R, #)-weak convergence implies convergence in the sense of H- valued distributions on R. 
Choose a test function G C^°(R, V): then 



/« 



(/(*), <f>(s)) ds = I (u m(l) (s), (An(^' + Pm®<i> - B*m{l)4> - «m«M)( s )) ds ( 46 ) 

(u(s), (A<j>' + P<j>- B*<f> - R*[<p])(s)} ds (47) 

+ l\(u m{l) { S )-u{s)),{A^' + P ( f ) -B*^-R*[<p]){s))ds (48) 

(u m(l) (s), ((A m(l) - A)<t>' + (P m(0 - P)<j> (49) 

-{B m{l) - B)*^ - (R m(l) - R)*[(f)])(s)\ ds (50) 



The second term vanishes in the limit I —> oo because of the weak convergence of u m ^ to u. The 
coefficients A m , ... range over bounded sets of operators, so we may replace 4> and 4>' in the third 
term with simple V^-valued functions, taking finitely many values, at the price of an arbitrarily 
small perturbation in this term, uniformly in /. However the strong convergence of the coefficient 
operators assumed in the statement of the theorem then implies that the resulting integrals become 
arbitrarily small as / — > oo. Thus u is a weak solution of the problem Q, and must therefore be 
the same as the (unique) weak solution u constructed in the preceding section. Thus no other 
weak accumulation point of the bounded sequence {u m } may exist, hence u m — 1 u in L^ oc (R, H) 
as claimed. □ 

Lemma 2. Under the conditions of Theorem^ u m converges to u weakly, pointwise int € R and 
uniformly on compact sets. That is, u m {t) — u(t) for all t £ R, and for any w G H , Tq < T € R, 

lim \(u m - u,w)\ L °on Tl b T n = 0. 

Proof. According to Corollary [4J the conditions described in the statement of Theorem [7] imply 
that {u m : m G N} is equicontinuous, hence uniformly equicontinuous on compact sets. Choose 
T < T £ R. Given e > 0, choose At > so that if \5t\ < At, t £ [T , T], then 

\\u m (t + Si) - u m (t)\\ < e, m G N; \\u(t + 5t) - u{t)\\ < e, 
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which implies that 

u m (t) 



1 



t+At 



Ur, 



2At J t _ At 

However, according to Lemma [U for any w G H 
1 



< e, m G N; 







rt+At 




)- 1 






; 2At 


/ U 

h-At 



< e. 



t+At 



1 



(u m -u,w) = / (u m -u, w——l [t _ Attt+At ] ) -> 0, m -> oo 



2At ,/ t _ At J \ 2 At' 

Therefore, assuming without loss of generality that [|«;[| = 1, 

\{u m (t) - u(t),w)\ < 3e 

for m sufficiently large, t G [Tq,T]. Since Tq,T, and e > are arbitrary, the proof is complete. □ 

We require one more fact about the memory operators R: 

Lemma 3. Suppose that {Q m : m G N} C ^ 1 (R, B symm (H)) n C°(R + , B(H)) defines a sequence of 
operators R m : L+(R,H) — > I/L(R, H) converging strongly to zero. Let K, C Li.(R, H) be bounded, 
and assume that all u G /C are causal, with common support suppu C po,oo). Then for any 
sequence {u m : m G N} C K., and any v G Lf QC (H, H), any T G R, 



dt(v(t),R m [u m ](t)) ->■ 0, m -> oo 



Proof. 



T f t 

dt I ds (v(t),R m [u m ](t)) 



T f t 

/ ds (v(t),Q m (t - s)u m (s)} 

-oo JTq 
-To ra 

da dr (u m (-a),Q m (a - t)v(-t)) 

-T J-T 
dt (M-T-T ]{t)u m {t),R m [l[-T-T ]v](t)). 



Note that we have used the symmetry of Q m , and it is for this reason that the assumption was intro- 
duced. According to the hypothesis, ||-R»nM ||L 2 ((-oo,t]) -> as m -> oo for any w G L + (R, H),t G 
R. Setting t = —To, w = l[_ Ti _7yu. Since K. G L^ ans (R, H) is bounded, the sequence 

{\\l[-T-T ](t)u m \\ L 2(n,H)} C R 
is bounded, whence the conclusion follows via the Cauchy-Schwarz inequality. □ 
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Proof, (of Theorem [7]) We will use repeatedly the algebraic identity: for K,L £ B(H),v,w £ H, 

(v, Kv) — (w, Lw) = (v — w, K{v — w)) + {v — w, Lw) 

+(w, L(v -w) + (v, (K - L)w) + (w, (K - L)(v - w)). 
If K and L are symmetric, this identity simplifies to 

(v, Kv) — {w, Lw) = (v — w, K(v — w)) + 2(v — w, Lw) + (2v — w, (K — L)w) 

Application of the second version of this identity with v = u m ,w = u,K = A m , L = A yields 

(u m ,A m u m ) - (u,Au) = (u m - u,A m (u m - u)) 

+(2u m - u, (A m - A)u) + 2(u m - u, Au). (51) 

The right-hand side / in the formal evolution equation @ for both u m and u vanishes for sufficiently 
large negative t, else u could not be causal, but then u m and u must vanish on a common (negative) 
half-axis, thanks to Corollary [21 The energy identity (|34p implies that 

(u m ,A m u m ) - (u,Au)(t) 

[{B m u m ,u m ) - (Bu,u) + {R m [u m },u m ) ~ {R[u],u) - {f,u m - u)\. 
Application of the algebraic identities stated above shows that the right-hand side is 



[(u m - u, B m (u m - u)) + (2u m - u, (B m - B)u) + 2(u m - u, Bu) 
(u m - u,R m [u m - u\) + {u m ,R m [u] - R[u\) + (u,R m [u m - it] - R[u m - it]) 
+(it m - u,i?[it]) + (u,R[u m - u]) + (f,u m - u)]. (52) 



Identities (|5T|) and (j52|) combine to yield 



(u m - u,A m (u m - u))(t) = - I (B m (u m - it) + R m [u m - u},u m - u) + g m {t), (53) 

-co 



in which g m G C (R) is defined by 



g m (t) = -(2u m (t)-u(t),(A m -A)u(t))-2((u m -u)(t),Au(t)) 
- / [(2it m - u, (B m - B)u) + 2(u m - u, Bu) 

J — oo 

+(u m ,R m [u} - R[u}) + (u,R m [u m -u]- R[u m - u\) 

+{u m - u,R[u\) + (u,R[u m - u]) + (f,u m - u)]. (54) 
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Since the B m 7 s are uniformly bounded operators on H and the i? m 's are uniformly bounded oper- 
ators on L 2 ((— oo, t], H) for every t G R (with norm independent of t), (|53p implies that 

\\u m - u\\ 2 (t) < C(u m - u, A m (u m - u))(t) < C J \\u m - u\\ 2 + \g m (t)\. (55) 

J — oo 

in which C means something different each time it occurs, as before, but depends on the quantities 
indicated the second section. 

Select To for which u(t) = u m {t) = for all t < To,m G N. Choose T G R. Application of 
Gronwall's inequality to (|55p yields, for Tq < t < T and C depending on T along with everything 
else, 

\\u m ~ U\\ 2 (t) < C T -T \g m \(t). (56) 

in which Ct~t depends onT — Tq in addition to the standard dependencies. 

It remains to see that g m (t) —> uniformly in t G [To,T], as m — > oo. The first term in 
(|54p tends to zero pointwise (in t) thanks to the assumption that A m — > A strongly, and to the 
energy estimate (Theorem [5]) which assures that || Uijyi U 1 1 IS bounded uniformly in m G N and 
t G [To,T]. On the other hand, Corollary U] and the uniform bounds on {A m : m G N} implied by 
the assumption that {A m , B m , Q m ) : m G N} C V(C*, C* , Cb, Cq) in turn imply that the first term 
in (|54p defines an equicontinuous sequence of continuous functions on [To , T] . Since any convergent 
subsequence converges pointwise to the zero function, so does the entire sequence, and uniformly. 
The second term in (|54p tends to zero, uniformly on [To,T], thanks to Lemma [21 the third because 
B m — > B strongly, the fourth, seventh, eighth and ninth because of Lemma [T] and Theorem [5j the 
fifth because R m — > R strongly in L^(R, H), the sixth because of Lemma [3l 

□ 

Remark. This result is sharp, in the sense that nothing stronger than continuity can be expected 
without additional constraints on the various components of the formal evolution problem ([9]). In 
particular, the modulus of continuity cannot be uniform in the right-hand side (/ G L 2 (R,H)), 
even locally. 

For example, the ID linear advection problem 



27 



conforms to the setting described above, with H = L 2 (R). The operator coefficients are: A = 
multiplication by the positive constant 1/c, P = d/dx, skew-adjoint with domain V = ff 1 (R), and 
B = 0, Q = 0. For any / € L 2 (R?) (= L 2 (R, H) by Fubini's Theorem), the causal weak solution is 



in which we have explicitly indicated the dependence of the weak solution on the coefficient 1/c 
and the right-hand side /. 

Suppose x £ Cq ^)) suppx C [—1,1], and 



For e > 0, set f e (t, x) = cos((x + t)/e)x{x + t)x{x). Then u[l, fe](t, x) = cos((x + t)/e)x(x + t) for 
t > 1, whereas integration by parts shows that 



for c / 1. Thus the modulus of continuity of (c, f t ) h-> u[c, f t ](t, ■) £ H (for t > 1) cannot be 
uniform over the bounded set [a, 6] x {/ e : e > 0} C R x L 2 (R, H), if 1 € (a, b), and in particular 
this map is not locally uniformly continuous. The heuristic reason is that c is the wave speed, so 
changing c changes the position of arriving waves. This position shift has unboundedly large impact 
if the frequency of oscillation in the solution is allowed to become arbitrarily large. Oscillations in 
the data in time translate into oscillations in the solution in space, of similar frequency, because 
the problem is hyperbolic. 

On the other hand, additional regularity in time of the right-hand side entails more regular 
behaviour of the solution, as one might expect since such regularity damps temporal frequencies. 

Corollary 7. In addition to the hypotheses of Theorem [7| assume that f G Hf oc (R,H), k > 1. 
Then for any choice of Tq < T 6 R ; 




(57) 





lim \\u m — u\\ C k-in T T ] H ) = 0. 



Proof. Follows directly from Theorems O and [71 



□ 



Proof, of Theorem O The meaning of (I14p is that for any <f> 6 C^°(R, V) 




(58) 
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On the other hand, both u and Uh, h > 0, satisfy (|10p with the same right-hand side, so 

= Ujiu^Arf + Pt-BM-Rttt]) 
- J(u, A(j)' + P</>- B*<f> - R*[^ 
= J(u h ,6A<// -8B*(f)-8R* 



+ j (^^^ M , + p ^_ B ^_ R * [(j)] y (59) 

Subtracting (I58p from (I59|) and rearranging, obtain 

/ ( \ h U ~ 5n ) ' ^ + P ^ ~ ~ R * = / ~ u ' ~ ~ 6R * ^ 

(5A(u h - it)' + 5B(u h -u) + 8R[u h -u],<f>). (60) 
In view of equation (|60p . the Newton quotient remainder 

Uh - u 

: OU 

h 

is the weak solution of ([9]) with right-hand side 

5A(u h - u)' + SB(u h -u) + 5R[u h -u]e Lf oc (R, H). 
In view of Corollary [7] and the energy estimate (Theorem [5]) imply that 

\\8A(u h - u)' + 5B(u h -u) + 5R[u h - u]^^^^^ _> 
as h — > for any T € R. The conclusion then follows from anther use of Theorem [SJ □ 

Remark. This result is also sharp, in the sense that the right-hand side must have at least one 
square-integrable derivative in t, if only additional regularity in t is to be imposed. For example, 
the solution (|57p of the linear advection equation presented above may be rewritten as 



u(t,x) = J f n+^—^,y\ dy, 



from which it is straightforward to see that no less regularity in t will do. On the other hand, 
the expression (|57p suggests that additional regularity in x might also support differentiate de- 
pendence on c. However this conclusion rests on a special feature of the example problem, namely 
that it admits a propagation of singularity principle (and indeed solution via the method of char- 
acteristics, an even more special property). Propagation of singularities along bicharacteristics 
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holds for symmetric or strictly hyperbolic systems with smooth coefficients ( see for example 



1984 



( 1981 )), and to some limited extent fo r 



Svmes 



1986; 



Lewis and Svmes 



systems with less regu 



1991 



Bao and Svmes 



ar co efficients (jBeals and Reedl . 



Taylor 



1982, 



1996). Stronger regularity results for 



dependence on coefficients follow for some of these systems. 

Proof, of Theorem [9j we present the case k = 2. The general case follows by an induction argument, 
which we omit. 

As usual, denote by To G R a lower bound for supp/. In the notation of the proof of Theorem 
1b£ C 2 (R,H) n C^R, V), thanks to Theorem [U Therefore the right-hand side of of dD has 
a locally square-integrable derivative, whence the causal weak solution 5u actually belongs to the 
class C 1 (R, H) n C°(R, V). Applying Theorems [5] and [6] repeatedly, one sees that 5u satisfies for 
any Ti G R 

ll<MU<»(pb,r 1 ],v) ^ c 't 1 -ToI|(^,^-B,(5Q)||p||/|| h2([ToiTi]) . 
It follows that the linear map DT[(A,B,Q)] : M -»■ C°([T ,Ti],W) defined by 

DF[(A, B, Q)\ (5 A, 5B, 5Q) = S5u 

is continuous. 

Suppose that (A m , B m ,Q m ) — > (A,B,Q) in norm (of M). Denote by 5u m the solution of 
(fT3|) with (A,B,Q) replaced by (A m , B m ,Q m ). It follows from the definition (equation (JHJ)) that 
5u m — 5u is the (strong) solution of 

A(Su m — Su)' + P(5u m — Su) + B(5u m — Su) + R[5u m — Su] 

= -[(A m -A)5u m +(B m -B)5u m +R m [5u m }-R m [5u}+5A{u m -u) l +5B{u m -u)+ (61) 

Theorem [6] implies that {5u m : m G N} is a bounded set in C 1 ((— oo, T\], H) n C ((— oo,Ti],V), 
whence in i?j^ c ((— oo, Ti], i7) also; moreover supp<5u m C [To,oo) for all m G N. It follows that the 
first three terms on the right-hand side of f|61|) tend to zero in i^ 1 ([To, Ti], -ff). Applying Theorem 
[7] to u' m — u', the difference of solutions of equations of the form ([9]) with the same causal right-hand 



side /' G H^ oc (R,H), one sees that 



in H ([Tq } Ti], i?) also, whence the second three 



terms also tend to zero in this sense. Thus the entire right-hand side of EH tends to zero as m — > oo 
in the sense of JT 1 ([To, Ti], H). Now it follows from Theorem [U] that 



{DF[{A m , B m , Q m )\ - DF[(A, B, Q)])(5A, 5B, 5Q) = S{5u m - Su) -> 
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(in the sense of C°([T 0j T x ], W). That is, DT : V x X -»• C°[T , Ti], IT) is continuous, and the 
theorem is proved in the case k = 2. 

□ 



SYMMETRIC HYPERBOLIC SYSTEMS: CONTINUOUS DEPENDENCE, 

PROOFS OF THEOREMS ?? - ?? 

For convenience, we repeat the key definitions from the second section. 

Recall that k x k symmetric hyperbolic systems take the form 
du 

a— + p(V)it + bu + q * u = f in x R; u = for t < 0, (62) 

in which the coefficient matrices a, 6, and q are k x k, and the A; x fc matrix differential operator in 
the "space" variables x 6 SI has symmetric and constant coefficient matrices. 

The set of admissible coefficients M C L°°(ft, Rj**J x L°°(0, R fexfc ) x L X (R + , L°°(fi, R^J) 
is defined by 

M(C*, C*, C B , C q ) = {(a, 6, g) : CJ < o(x) < C*I for all x e 0, ||6|| L -(n,R^ fc ) < C Bj 

lkllLi(R f ,L«(n I H^Jy) < C Q>l(t) = for t < 0}, 
M = IJ{M(C # ,C**,C* B) C q ) : 0<C*<C'*,C b ,CqGR + }. (63) 
M corresponds to a collection of problems of the form (|62p . with common p(V). 

The Hibert space of states is H = L 2 (£l) k . Provide p(V) with a dense domain V C H, and 
assume the p(V) : V — > H is skew-adjoint, and that the norm in V is equivalent to the graph norm 
ofp(V). 

The hypotheses of Theorem [5] are all satisfied, so 

Corollary 8. Suppose that (a,b,q) £ M, and f £ Lj oc (R, H) is causal. Then there exists a unique 
weak solution of ([62]) . satisfying for every 4> £ -f^ 1 (R, L 2 (Q) k ) 

/OO P 
dt / dx [u(-A<j>' - p{V)<p + b(f> + q * <f>) - f<j>] = 0. 
-oo J C 
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There exists an increasing function C : R+ — > R+, depending on C*,C*, and bounds for b,q, so 
that for every t G R, 

12 .- /'<m / ./-II r, -,ll~ ( 64 ) 



K,t)||i w <C(t) / dr||/(-,r)|| 



Theorems Q] and [2] now follow directly from Theorems [8] and [9] respectively. 

Multiplication by a member of L°°(Cl) defines a continuous map from L°°(f2) to B(L 2 (il)), 
so convergence of the coefficients a,b,q in L°°(0) is sufficient to induce uniform, hence strong, 
convergence of the corresponding operators, hence convergence of the solutions per Theorem [7J 
However, convergence in a weaker sense is also sufficient to induce stro ng operator convergence. 



The key observation is the following result, identical to Lemma 2.8.5 in ([Stolid . 



20001 ): 



Lemma 4. Let (Q,A,/i) be a measure space, {r m } C L°°{Q.,jj) with ||r m ||£oo(^ ^ < R G R+, 
{/m} C L 2 (Q,,n) with \\fm\\L 2 (n,^) < F G R+ for all m G N. Suppose that r rn — > m [i-measure. 
Then for any g G L 2 (Q,/j,), 



limm^oo / r m f m gdn = 0. 



(65) 



Proof. Suppose on the contrary that such sequences {r m },{f m } and square- integr able g exist, also 
an rj > 0, for which the left-hand side of (|65p remains > r/ along a common subsequence. Without 
loss of generality, renumber the subsequence so that 



fmgd/j. 



> r], m G N. 



(66) 



Convergence in measure of {r m } means that for any e > 0, 

fi[E e (r m )] — > as m — > oo, where E € (r m ) = {x G E : |r m (x)| > e}. 
Choose e so that e-F||(?||L2(Q jAt ) < rj/2. 

From this definition and the Cauchy-Schwarz inequality, one sees that 



' m f m gdfi 



<e / \f m g\dfj, + R / \f m g\dfi 

E\E e (r m ) JE e {r m ) 



~ 2 d{i <\ + RF 



<tF\\g\\LHE,B,») + RF\ / g 

\JE e (r m ) 

By passing if necessary to a further subsequence, we may assume that 



g 2 d[i 



E e (r m ) 
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Thus the characteristic functions of the sets E e (r m ) are almost everywhere convergent to zero as 
m — > oo. Since |<7| 2 G L (f2, /i), it follows from the Lebesgue Dominated Convergence Theorem that 
for large enough m, 



9 2 dA < 11 



2RF 



Thus the left-hand side of (|66p can be made smaller than 77, a contradiction. □ 

Lemma 5. Suppose that {a m }^ =1 C L°°(R n ) converges in measure to a G L°°(R n ), and iaai 
A m ,m G N and A G i3(L 2 (R n )) are defined by 

(A m u)(x) = a m (x)u(x), (A«)(x) = a(x)n(x), it G L 2 (R n ). 

Taen A m — )■ ^4 strongly. The same is true for similar sequences of operators on L 2 (R n ) p defined 
by sequences of p x p matrix-valued functions whose components converge in measure. 

Proof. In fact, the operators so defined are self-adjoint, and 

\\A m u - Au\\ 2 = / (a m - a)[(a m - a)u]u ->■ 0, 



as follows from Lemma U taking a m — a for r m , (a m — a)u for f m , and u for g in the notation of 
that lemma. □ 

Proof, of Theorem [3j follows immediately from Lemma [5] and Theorem [71 □ 

One way to obtain convergence in measure is via mollification. Let {r] m : m G N} be a Dirac 
sequence of mollifiers, as in the proof of Proposition [2j Since a G L^ oc (R d ) for any p, rj m * a — > a 
pointwise almost everywhere, hence in measure. An application of this observation is the finite 
speed of propagation property: for solutions of (|62j) . support expands at finite spee d. This pro perty 



Lax! (|200fih . Ch. 



is well-understood for hyperbolic systems with smooth coefficients: for example, 
4, presents a proof of: 

Proposition 3. In addition to the hypotheses of Theorem{T\ suppose that the coefficients in ([62 
are smooth (of class C°°(fi) / ). Then 

1. the causal weak solution u G L 2 oc (R, H) is smooth: u G C 00 (R n+1 ) p ; and 
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2. if<p€ C°°(R n ) satisfies 

Edd> 

and supp(/) f|{(x, t) : </>(x) > t} = 0, then u(x,i) = if <j)(x) < t. 
Corollary 9. In the setting of Proposition^ suppose that r G R satisfies 

n 

ra(x) + ^^>0, xeR", |£| = 1. 

If xq G R n , to ^ R satisfy 



i=l 



/(x, t) = if r|x - x | + t - * > 0, 



then u(xq, to) = 0. 



Note that the argument given in (ILaxl . 



(67) 



20061 ) does not quite encompass Proposition [3] and 



Corollary [9j it does not apply to systems like (|62p with memory terms. However the extension is 
straightforward. 



Proof, of Theorem U) Using a sequence of mollifiers as before, obtain a sequence {(a m ,b m ,q m ) : 
m G N} of smooth coefficients converging in measure to (a, b, q). Then the corresponding operators 
converge strongly to those induced by (a,b,q). Since ([5]) holds almost everywhere, it follows that 
a m satisfies the spectral inequality (|67p in R n for sufficiently large m, whence Corollary [9] implies 
that u m vanishes in uj. Since, u m — > u in L 2 (Q) k the conclusion follows. □ 



AN EXAMPLE: VISCOELASTICICTY 



The dynamic equations of linear viscoelasticity may be written as 

p m = v - fT + f ' 



(68) 



in which v is the particle velocity field, a the stress tensor , f a body for ce density, p the mass 



( Christensen , 


1983; 


Pipkin , 


1986) 



19861 ). Viscoelasticity differs 



from elasticity in that the inverse Hooke operator is a causal convolution operator (in time), rather 
than a temporally local multiplication operator. It follows from (|68p that the strain rate (right- 
hand side of the second equation) is the convolution of the stress with the indefinite time integral 
of r. 



34 



We will assume instantaneous elastic response: a nonzero strain rate arises immediately from a 
stress impulse. Under this assumption, the kernel T can be decomposed as 

r(t) = r e «5(t)+ 7 (t), 

in which T e is the elastic inverse Hooke tensor (inverse of the unrelaxed modulus) , and 7 is a causal 
kernel. Both the elastic kernel T e and the memory kernel ~f(t) act as spatially-variable, symmetry- 
preserving linear operators on symmetric tensor fields. The conventional representation of such 
things by 4-index tensors, 



r e - ( r ijfci)|- jVM=1 > 7- hijki)\l jAl=l , 



thus entail the symmetries 



^ijkl — Fjikl — ^ijlk — ^kliji hj> k,l — 1, 2, 3, 



(69) 



and similarly for 7. 

To avoid technical complications, assume that the viscoelastic material occupies all of R 3 . We 
require that, for some < g* < g* , 



1. r e is elliptic: for any symmetric a G R 3x3 , 

gAWW < ll re ( x )°1l < 9*\W\\, x e r3 ; 



(70) 



2. Pa»(R 3 ,6(R^ m )); 

3. 7e^ 1 ' 1 (H.^°°(R 3 ^(Rs 3 y x im))) 



For the "state space" H of the viscoelastic system we choose H = L 2 (R 3 , R 9 ) = L 2 (R 3 , R 3x ' 



syrava) 



L 2 (R 3 ,R 3 ). The inner product in H is defined by 



(ui,u 2 ) = / trofo-2 + vfv 2 , 
iR3 



/ \ 

a 



u 



The assumptions 1-3 above and the symmetries (J69J) imply that 



Au 



( \ 



G H 
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defines a bounded, self-adjoint positive-definite operator A G B(H). 

Define the differential operator p(V) : C£°(R 3 ,R 9 ) -»■ C^°(R 3 ,R 9 ) by 

i(Vv + Vv T ) 
V-<7 



p(V)u 



V 





/ \ 




a 






/ 





,- poo/p3 "d9\ — /^^/TjS t>3x3 \ v ,^oo/t>3 to 
G O (K , K j = Oq (K , K s mm J X C ( It It 



p(V) is antisymmetric and densely denned in H. Define 

• H\ iv (Q, Rgymm) to be the subspace of L 2 (Q, R 3 ym m ) consisting of square-integrable symmetric 
matrix valued functions, each column of has a square-integrable divergence; 

• Hf iee (Q) to be the subspace of vectors in a G H^ iv (£l, Rgymm) satisfying 



cr(x) • n = 0, a. e. x G d£l; 



v = Hl ee (nfxH 1 (nf. 



An argument similar to that explained in Appendix A shows that p(V) has a self-adjoint extension 
P : V —> H, and that the natural norm on V is equivalent to the graph norm of P. 



Let 



hm 7 (V) e ^(R^(R^4J), 



and 



Then 



q = ( lim l [t)00) |l G L 1 (R,L 0O (R 3 ,e(R s 3 y ^ m ))). 



5o " , 
7 * — = ocr + q * a. 

at 



Define B G B(H) and Q G L 1 (R, B{H)) by 

= , Q(t)u = 

V / 

Finally, define / G Lj^R.tf) by / = (0,f) T . 



K J 



( \ 



a 



With these definitions, the system (|68p is formally equivalent to the evolution problem Q. 
The theory developed here thus assures the existence of weak solutions of (|68p . in material models 
including discontinuities of densities and/or elastic moduli and/or relaxation moduli. 

An immediate consequence of Theorem [5] is 
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Corollary 10. Denote by c v the maximum quasi-p-wave velocity of the viscoelastic system (|68p . 
defined as 

c p = ess sup{A max (r e (x)[^ T ]/p(x) : x,£ G R 3 ,^ = !}• 
Suppose that (xo,io) satisfies 

|x - x | > c p (t - *) 

/or every (x, i) G suppf. T/ien i/te causal weak solution (er, v) 0/ (|68|) vanishes in a neighborhood 
of (xo,to)- 

DISCUSSION 

The theory developed in the preceding pages provides a basic framework for inverse problems 
defined by symmetric hyperbolic systems. However a number of important issues remain to be 
addressed. We shall describe some of these, and some implications of our theory in each case. 

Descent directions 

We have shown how to formulate the "forward map" (J^f^m i n the notation of Theorem |2| as a 
continuously differentiable map on an open subset of a suitable Banach space, with values in a 
(subset of a) suitable Hilbert space. Thus least squares objective functions for such problems are 
continuously differentiable in a well-defined sense. However, the domain metric specified by the 
theory is some version of L°°, in the case of concrete problems of form ([1]), or the operator norm 
in the case of abstract problems (Theorem [9|) . Thus the ambient Banach space is in all cases 
nonreflexive, and does no t have the properti e s requ ired for a sensible definition of gradient, as 



explained for instance by 



Kaltenbacher et al 



(2008). Thus differentiability does not necessarily 



bring with it natural access to descent directions. 

Additional information about the derivative is available, however, via the well-known adjoint 
state method. For brevity, we explain this construction in the case of the abstract evolution problem 
§§§ in the differential case and without lower-order term, that is, B, R = 0, and we proceed formally. 
With notation as in the statement of Theorem [UJ we define 

F Lm :Bt ymm (H)^L 2 ([0,T],W) 
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by 

Ff,m[A] = S[m]u, 
where u £ C l (R, H) n C°(R, V) is the solution of 

Au' + Pu = f G fl£ c (R, H), f{t),u(t) = 0,t<0 

guaranteed by the theory. If k > 2, then \calFj m is of class C , and its derivative is given by 

D7) )m [;l]&A = S , [m](5«, 



where 



ASu 1 + P5u = -<W, Su(t) = 0,t<0 



Suppose that we are provided data d E L 2 ([0,T],W). The adjoint state method (|Plessbd . 120061 ) 
represents the derivative of the least-squares function 

J firn [A;d] = ^\\T Lm [A\-d\\ 2 



as follows. Suppose that w is a strong solution of 



A'w + Pw = S[m]*(d- Ff tm [A\); w{t) = 0,t>T. 



(71) 



Then for any 6 A £ B symra (H), 

DJf,m[A; d]5A 



r 



dt(5Au'(t),w(t)}. 



(72) 



It is possible to extract a gradient, that is, a direction of fastest ascent for J/, m , from this repre- 
sentation of the derivative. Write (for u, v £ H) u®v for the rank 1 member of B{H) defined by 
u <S> v(w) = (v, w)u. Such rank-1 operators are very special instances of operators of trace class. 



We refer to 



Conway 



(|l990l ). pp. 267-268 and 275 for the properties of this subspace of B(H) cited 
here, in particular these: the product of a bounded operator and a trace-class operator is of trace 
class; a real- valued linear trace function tr is defined on the trace class, generaizing the trace of 
matrices. The adjoint-state expression ([72]) is equivalent to 

fT 



DJ f , m [A;d]5A = tr fsA J dtu'(t)®w(t) 



(73) 



well-defined since the integral inside the parenthesis defines a trace-class operator. Ths trace of the 
product defines a duality pairing, expressing the trace class as the "predual" of B(H). 
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Assuming that /, d ars sufficiently smooth in t, the integrand in (|73p is well-approximated by 
quadrature: 

/ [T/At] \ 

DJ fjm [A;d]5A = tr MAf ^ u'(nAt) ® w(nAt)\ (74) 
V "=° / 

Linear functionals on B{H) of the form given by the right-hand side of (|74p form precisely the 

subset of the dual B{H)* consisting of functionals continuous in the weak operator topology. Since 

the unit ball of B(H) is compact with respect to weak convergence, the functional defined by the 

right-hand side of (|74p has a maximizer over the ball, which is a candidate for an approximate 

gradient of Jf, m at A. 

This observation remains correct if A and 5 A are restricted to subspaces of B{H), for example 
L°° matrix- valued functions as in the definition of symmetric hyperbolic systems (pQ). 

The program outlined in the preceding paragraphs leaves a number of details to be filled in. 
For example, actually finding the maximizer of the linear functional in (|74|) is in some sense a 
finite dimensional problem, but not a particularly simple one. Also, the precise relation between 
right-hand sides of (|T3[) and (|74p remains to be established. Finally, the adjoint state evolution 
problem (fTTj) involves a right-hand side (S[m]*(d — F m j[A]) £ L^ oc (R, V*) outside of the class for 
which solutions, strong or weak, have been shown to exist. We will address this latter problem in 
the next subsection. 

We also note that the gradient, defined as a direction of fastest ascent, is not smooth, or even 
necessarily a function, even under hypotheses which guarantee that <//, m has several derivatives. 
This observation leads to v arious developmen t s in c onvex optimization, beyond the scope of this 



paper. We note merely that 



Bamberger et al 



(|1979l ). a landmark early paper on inverse problems 



in wave propagation, addressed this point explicitly. 



Sharpness of trace regularity and source definition 

Theorem [2] gives sufficient conditions for the trace of a solution of the symmetric hyperbolic system 
to be well-defined and depend smoothly on the coefficients and right-hand side in ([1]). However 
these conditions are only sufficient, not necessary. Under some conditions, the trace of even a weak 
solution may be well-defined and depend continuously on the coefficients and right-hand side. 
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For example, suppose that the coefficients are smooth in a neighborhood u of the boundary 
<9f2 and containing the measurement surface T, that the right-hand side is supported outside of 
this neighborhood, and that no rays of geometric optics originating outside this neighborhood 
are tangent to the boundary. Then microlocal propagation of regularity shows that the the map 
/ e H^([0,T],L 2 (n \ u) k ) -> L 2 (\0,T},L 2 (T) 1 ) extends continuously t o a map L 2 ([0,T], L 2 (0 \ 



u) k ) -> L 2 ([0,T],L 2 {T) 1 ). See for example rtSvmes! . 



1983 



Lasieckal . 



1986). 



By duality, sources supported on T also give rise to well-defined solutions with estimates similar 
to those proven in the body of this paper. Combining these two ob s ervat ions leads to definition of 



variants of the Dirichlet-to-Neumann map ([Sylvester and Uhlmannl . I199Q ) which forms the natural 
abstraction of "data" for many inverse problems. These mappings, from suitable (source) data 
supported on T to other data supported on T ( or on another similar surface), thus inherit a 
definition and regularity properties for minimally regular coefficient classes as described above. 



Nonphysical extension via operator coefficients 



Perhaps surprisingly, the results on systems with operator coefficients (Theorems 5-9) seem likely 
to be useful in themselves. This utility originates in the resistance of inverse problems for hyperbolic 
systems to the least-squares (or least-error) approaches that have successfully treated many other 
types of inverse problems in science and engineering. Natural objective functions change rapidliy 
in some directions, not in others, and appear to possess many stationary points far from any useful 
model estimate, a nearly fatal feature for variants of Newton's method , ofte n the only feasible 



approach to model estimation (jGauthier et al 



1986; 



Santosa and Svmes . 



19891 ). 



The last-named author has suggested that use of operator coefficients as an exte nsion of norma l 



continuum physics could conceivably convexity the normal least-squares objective (jSvmesl . 120081 ). 
Far from an arbitrary introduction of additional degrees of freedom, this reformulation of the inverse 
problem is closely relate d to well-tested ideas in seismic data pro cessing, and has already a partial 



theoretical justification (jStolk et al. 



2009; 



Shen and Svmes . 



2008). Very recently, implementations 



of exte nded least squares inversion using operator coefficients have produced very promising early 



results (jBiondi and Almomin . 



20121 ) which tend to support the convexification hypothesis. 
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And so on.. 



The fundamental model problem (pQ) does not include static differential constraints, so the results 
established above do not apply directly to Maxwell's equations, for example. We suspect that the 
theory can be extended to accommodate such problems. 

Imposition of additional regularity requirements on coefficient matrices, beyond boundednes s 



and measurability, leads to existence of well-behaved solutions with less regularity. IStolk 



(|2000h 



offers some results concerning second-order problems with additional regularity of both coefficients 
and data. 



Both physical heuristics and numerical evidence for inverse problems in seismology (IVirieux and Qpertd . 



2009) suggest strongly that the mapping T is more linear when the right-hand side is smoother in 
time. Estimates similar to those developed in the proof of Theorem 9 show that for right-hand sides 
of the form f(et), the Newton quotient remainder is 0(he) relative to the directional derivative, for 
any (5A,5B,5Q) not in the null space of DT . This observation can be developed into a theoreti- 
cal justification of the frequency continuation strategy employed in every successful contemporary 
algorithm for least squares inversion of seismic data. 



CONCLUSION 

This paper has demonstrated that the relation between coefficients and solutions of symmetric hy- 
perbolic integro-differential systems has many of the regularity properties assumed (usually without 
comment) in the applied literature on inverse problems in wave propagation, while allowing for a 
degree of irregularity in the spatial dependence of coefficients which seems adequate to model the 
heterogeneity observed in real materials. The methods of proof are classical, dating to the middle 
of the last century, yet so far as we are aware these results have not previously been explicitly 
formulated or collected in one place. Several by-products of our development are interesting in 
themselves: finite speed of propagation for waves in highly heterogeneous materials, a theoretical 
framework for the systems with operator coefficients which have recently arisen as a key element 
of convexifying relaxations for waveform inversion, and the clear necessity of developing descent 
algorithms for non-reflexive Banach metrics, for example. We have also pointed out a number of 
directions in which the theory suggests extensions, or poses open questions. Indeed, we expect that 
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our results will serve as an initial step in the development of a solid mathematical foundation for 
inverse problems in heterogeneous media. 
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APPENDIX A 



THE SKEW-ADJOINT PROPERTY OF THE ACOUSTIC GRAD-DIV 

OPERATOR 

We establish skew-adjointness of the operator D, defined formally in (J7|), by appealing to an auxiliary 
result, similar to necessary and sufficient cond itions for self-adjointness found in many texts on 
functional analysis (for example, Conway ( 199ol )). 



Lemma 6. Suppose that H is a Hilbert space, V C H a dense subspace, and L : V —> H a skew- 
symmetric linear operator. Then L is skew-adjoint if and only if for any c £ R \ {0}, L + cl is 
surjective. 

Proof. Suppose first that L is skew-symmetric and L + cl is surjective for every c G R \ {0}. Since 
the domain of L is dense, for any such c, 

ker(L* + cl) = rng(L + cI) L = {0}. 

Choose x £ V(L*), and c£R,c/0. Then there must be y G V so that (L — cl)y = (L* + cl)x. 
However, since L is skew-symmetric, V C T>(L*) so (L — cI)y = —(L* + cI)y = (L* + cl)x. However, 
we just saw that L* + cl is injective, so x = —y £ V. Thus V = V(L*) whence L is skew-adjoint. 

Conversely, if L is skew-adjoint, there can be no nonzero solutions to (L+cI)x = with nonzero 
c, as follows from skew-symmetry. Hence there are no nonzero solutions to (L* + cl)x = with 
nonzero c, since L = L* . That is, the kernel of L* + cl is trivial for no nzero c, whenc e the range of 
L + cI is dense for any nonzero c. However L is closed (see for instance 



Conway 



(1990), Proposition 



X.1.6, p. 305) , so its range i s closed (this is proved just as is a similar fact for closed symmetric 



operators, see 



Conwavl (|1990h . Proposition X.2.5, p. 310) thus rng(L + cl) = H. 



□ 



Proposition 4. The differential operator p(V) with domain C^°(0,) 4 , defined formally by (J7J), 
extends to a skew-adjoint operator P on H with dense domain 



V = H'(n)xH 1 div (Q). 
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Remark: The Hilbert space H^ iv (Q) is the dense subspace of L 2 (Q) 3 obtained by completing 
C 1 (0) 3 in the graph norm of the divergence operator. Equivalently, v £ L 2 (fi) 3 belongs to H^ iv (£l) 
if and only if there exists C > so that for every (ft £ Hq(Q), 

|(V0,V) L 2 (Q) 3| < C||0|| Z 2 (n ). 

Proof. Prom the definitions, p(V) extends to P : V — > L 2 (Q) 4 . According to Lemma [U it suffices 
to show that for (q,w) T € H , c € R\ {0}, there exists (p, v) T G V for which P(p, v) T = (g,w) T , 
that is, 

— Vp + cv = w, 
-V-v + cp = q. (A-l) 



The first equation is equivalent to the requirement that for <j) £ Hq(CI), 



-(V<^,Vp) +c(V</>,v) = (V&w). 



To satisfy the second equation in (| A- 1 [) . v G ^iv(^) necessarily, and 



i V • v) = (<£,<?- cp). 



Thus (|A-ip implies that 



( V<£, Vp) - c 2 (</», p) = ( V</>, w) - c(0, g) . 



(A-2) 



The left-hand side of (|A-2|) defines a bounded and coercive (negative-definite) form on Hq(Q) for 
any nonzero c, and the right hand side defi nes a continuous linear form on the same Hilbert space. 
The Lax-Milgram Theorem ( Yosida ( 19961 ) . III. 7) implies the existence of a unique p E Hq(CI) for 
which (|A-2p holds for every G 



With this choice of p, we are required to solve the second of the two conditions (|A-ip . In fact, 
a solution vq E H 1 ^) 3 C H^ iv (Q) exists satisfying 



l v o||ffi(n) 3 < C|k - c p\\l 2 (q) 



with a constant C depending only on fi. For a proof, see 



Brenner and Scottl (|2007l ). Lemma 11.2.3, 



who also give references to other results for domains for polygonal, rather than smooth, boundaries. 
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Set vi = c _1 Vp - v + c -1 w G L 2 (tt) 3 . Then for any (f> G 



c(V0,vi) 



(V0, 



Vp + CVq — w) 



(V0, 



Vp — w) — c(</>, V 



Vo) 



(V0, 



Vp - w) + c(</>, q 



cp) 



= 



(A-3) 



thanks to (|A^2|) . That is, vi is divergence-free, in the sense of distributions, and in particular 
vi G H^ iv (Q). Now set v = Vo + Vi. Then the first of the two conditions in ()A-1|) is satisfied by 
(p, v) thanks to the definiton of vi , whereas the construction of vo and the divergence- free property 
of vi imply that the second condition is also satisfied. 

Thus we have constructed a solution of (|A-1|) in V = Hq(Q) x H^. v (Q) for any c G R \ {0}, 
whence we conclude that P is skew-adjoint. □ 



If the memory term (convolution operator R) is absent, then initial data determine solutions 
uniquely. In this section, we sketch the theory, parallel to that for causal solutions, which holds in 
this differential case. We assume throughout this section that Q = 0. 

Corollary 11. Suppose that u G Lf oc (R, H) is a weak solution of (OJ). Then u G C°(R, H). 

Proof. Choose cf> G C°°(R) so that <j)(t) = 1 for t > 1 (say), and <j)(t) = for t < -1. Set u + = 4>u, 
U- = (1 — <p)u. It is straightforward to verify that u+ is a causal solution of @ with / replaced by 
/ + (j)'u G L 2 (R, H), whence u+ G C°(R, H) according to Corollar}{31 Likewise t \— > n_(— t) is also 
a causal solution of ([9]) with D, B replaced by —P, —B, and / replaced by t \-t —f(—t) + c/>'u(—t), 
which also belongs to L 2 (R, H). Thus U- is also continuous, but u = u + + u_. □ 

Corollary 12. Suppose that u is a weak solution of Q. Then for any s < t G R, 



APPENDIX B 



THE DIFFERENTIAL CASE 
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Proof. The energy identity (|34p applies to weak solutions, causal or not. Take into account R = 0, 
and use Gronwall's inequality, the boundedness of B, and the equivalence (|18p of the energy with 
the norm in H. □ 

Corollary 13. Ifu\ and ui are weak solutions of ([9|) (with the same right-hand side f € L 2 (R, H) ), 
and u\(s) = U2(s) for some s£R, then u\ = U2- 

Proof. Set u = u\ — U2'- u is a weak solution with right-hand side / = 0, and u(s) = 0. The result 
follows immediately from Corollary 1121 □ 

Theorem 10. Suppose that Tq £ R and uo G H. Then there exists a unique weak solution of (jSJ) 
for which it (To) = no- 

Proof. A proof of this result is precisely analogous to the proof of Theorem [5j the solution is 
approximated by a Galerkin procedure and the solution of systems of ordinary differential equations, 
and the energy in the error estimated (in this instance, via Corollary 112ft . □ 

Results precisely analogous to those established in the last section hold concerning regular 
dependence on the coefficient operators for weak solutions with specified initial data and right- 
hand side. We leave the reader to formulate these results, whose proofs are minor variants of those 
given above. 
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